How to perform the most accurate possible phase measurements 
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We present the theory of how to achieve phase measurements with the minimum possible variance 
in ways that are readily implementable with current experimental techniques. Measurements whose 
statistics have high-frequency fringes, such as those obtained from NOON states, have commensu- 
rately high information yield (as quantified by the Fisher information). However this information is 
also highly ambiguous because it does not distinguish between phases at the same point on different 
fringes. We provide schemes to eliminate this phase ambiguity in a highly efficient way, providing 
phase estimates with uncertainty that is within a small constant factor of the Heisenberg limit, the 
minimum allowed by the laws of quantum mechanics. These techniques apply to NOON state and 
multi-pass interferometry, as well as phase measurements in quantum computing. We have reported 
the experimental implementation of some of these schemes with multi-pass interferometry elsewhere. 
Here we present the theoretical foundation, and also present some new experimental results. There 
are three key innovations to the theory in this paper. First, we examine the intrinsic phase proper- 
ties of the sequence of states (in multiple time modes) via the equivalent two-mode state. Second, 
we identify the key feature of the equivalent state that enables the optimal scaling of the intrinsic 
phase uncertainty to be obtained. This enables us to identify appropriate combinations of states 
to use. The remaining difficulty is that the ideal phase measurements to achieve this intrinic phase 
uncertainty are often not physically realizable. The third innovation is to solve this problem by using 
realizable measurements that closely approximate the optimal measurements, enabling the optimal 
scaling to be preserved. We consider both adaptive and nonadaptive measurement schemes. 

PACS numbers: 03.65.Ta, 42.50.St, 03.67.-a 



I. INTRODUCTION 



The measurement of phase is an important task in both 
metrology and quantum computing. The measurement 
of optical phase is the basis of much precision measure- 
ment, whereas the measurement of the phase encoded in 
a register of qubits is vital to a broad range of quan- 
tum algorithms [3, [2, 3 ■ I n optical phase measurement 
the precision is usually bound by the standard quantum 
limit (SQL), where the phase uncertainty is 9(A r " 1 / 2 ) 
in the number of resources N [5l[ . On the other hand, 
the fundamental limit imposed by quantum mechanics is 
6(iV~ 1 ) 0,11, often called the Heisenberg limit. There 
have been many proposals to approach this limit. Ref. [i| 
proposed using squeezed states in one port of an interfer- 
ometer, as well as homodyne measurements, to beat the 
SQL. Measurements of this type have been experimen- 
tally demonstrated [H, 0, Q • Another type of nonclassical 
state that has been proposed |9| . llOfl and experimentally 
demonstrated [H Qj, M, Q \MM, [H GJ, OS M, El 
is the NOON state. These provide the maximum phase 
resolution for a given photon number, although they have 
the problem that they do not directly provide a unique 
estimate of the phase. 
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In quantum computing, the phase, corresponding to 
the eigenvalue of an operator, can be estimated using 
Kitaev's algorithm or the quantum phase estima- 
tion algorithm (QPEA) @, [||. The QPEA is based 
upon a pplying the inverse quantum Fourier transform 
(QFT) [21,111. Thc inverse QFT, followed by a computa- 
tional basis measurement, can be applied using just local 
measurements and control, without requiring entangling 
gates |Hj]. That simplification allows the phase mea- 
surement to be achieved optically, using only linear op- 
tics, photodetectors, and electronic feedback onto phase 
modulators [11. The optical implementation could use 
a succession of NOON states, or a succession of multiple 
passes of single photons (as was used in Ref. [11). Using 
NOON states, or multiple passes, results in high phase 
sensitivity, but an ambiguous phase estimate. The role 
of thc QPEA is to resolve this ambiguity. 

The minimum uncertainty for measurements of a phase 
shift is 0(-/V _1 ) in terms of the total number of applica- 
tions of that phase shift, N, regardless of whether those 
applications are applied in scries or in parallel pfjj . In 
quantum optical interferometry, N is the maximum to- 
tal number of passes of photons through the phase shift. 
In this formalism, a single pass of an TV-photon NOON 
state and TV passes of a single photon are regarded as the 
same number of resources. This is convenient because it 
enables physical systems that give mathematically iden- 
tical results to be treated within a unified mathemati- 
cal formalism. We emphasize that in practice, although 
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these resources are mathematically identical, they are not 
physically identical, and will be useful in different situa- 
tions. In particular, the resources for NOON states are 
used in parallel, which means that they are used within 
a short space of time. This is needed for measurements 
where there is a stringent time limit to the measurement, 
for example due to fluctuation of the phase to be mea- 
sured, or dccoherence of the physical system. In contrast, 
N passes of a single photon are using resources in series; 
i.e. the applications of the phase shift are sequential. This 
is useful for measurements of a fixed phase, where there 
is not an intrisic time limit, but it is required to mea- 
sure the phase with minimum energy passing through 
the sample. 

In this paper we examine the general problem of how 
to obtain the most accurate possible phase estimates, by 
efficiently eliminating ambiguities. This theory applies 
to general phase measurements in optics (with NOON 
states or multiple passes) and quantum computation, 
though for clarity we will primarily present the discus- 
sion in terms of NOON states. The problem with simply 
using the QPEA to eliminate phase ambiguities is that 
it produces a probability distribution with large tails, 
which means that the standard deviation is ©(iV -1 / 2 ), 
well short of the Heisenberg limit of ©(TV -1 ). A method 
of overcoming this problem was presented in Ref. [25| . 
which used an adaptive scheme to achieve @(A^ _1 ) scal- 
ing. This work was further expanded in Ref. [27j, which 
proved analytically that scaling at the Heisenberg limit 
can be achieved without needing adaptive measurements. 
References [25|, H3] demonstrated these schemes experi- 
mentally, using multiple passes of single photons. An al- 
ternative scheme based on adapting the size of the NOON 
states was proposed in Ref. [28[ . A method of eliminating 
phase ambiguities in the context of quantum metrology 
was provided in Ref. [29| . 

Here we present the theoretical foundations for Refs. 
[25l . [27j . with further analytical and numerical results, 
and some new experimental data. First, Sec. HIl explains 
the limit to the accuracy of phase measurement in more 
detail. Next, in Sec. IIIII the theoretical background to 
adaptive intcrfcromctric measurements and the relation 
to the QPEA is presented. The equivalent two-mode 
states when using repeated measurements are presented 
in Sec. HVl and it is shown that (with one exception) they 
have canonical phase variance scaling as the Heisenberg 
limit. The adaptive measurements are presented in Sec. 
IVl where the analytical results and numerical results for 
large N are given. In Sec. I VII it is shown that increasing 
the number of repetitions leads to a phase uncertainty 
that still scales as the SQL, rather than the Heisenberg 
limit. In Sec. IVIll the theory for some simplifications to 
the adaptive scheme is presented. These simplifications 
include a hybrid scheme as well as a nonadaptive scheme. 
The approach of adapting the size of the NOON state [28[ 
is given in Sec. I Villi Finally, we present the conclusions 
and a table summarizing the results in Sec. IIXI 



II. LIMITS TO PHASE MEASUREMENT 

The limit to the accuracy of phase measurements can 
be derived in a simple way from the uncertainty principle 
for phase [30l ] 

A0A?i>l/2, (2.1) 

where the uncertainties are quantified by the square root 
of the variance. For a single-mode optical field, n is the 
photon number, and the phase shift is given by the uni- 
tary exp(zn</>), with n the number operator. More gen- 
erally, one can consider a phase shift with n being any 
operator with nonnegative integer eigenvalues. The same 
uncertainty relation will hold, regardless of the particular 
physical realization. 

The uncertainty principle (|2.ip is exact if the vari- 
ance that is used for the phase is Vr = /-i~ 2 — 1, where 
[i = |(e'^)|, introduced by Holevo [3l|. The Holevo vari- 
ance coincides with the usual variance for a narrow dis- 
tribution peaked well away from the phase cut. Here (f> 
is an unbiased estimator of the phase, in the sense that 
e i(f> _ ^4>Y Note that the hat notation is used to indi- 
cate a phase estimator, rather than a phase operator. If 
one has a biased phase estimator, one must use instead 
(j, = (cos(0- 0)). 

If n is upper bounded by N, then the uncertainty in 
n can never exceed N/2. This implies that the phase 
uncertainty is lower bounded as 

A<j>>l/N. (2.2) 

Because this lower bound to the phase uncertainty may 
be derived from the uncertainty principle for phase, it is 
usually called the Heisenberg limit. This derivation was 
presented in terms of the standard deviation for the phase 
and the mean photon number in Refs. [32l . [33j . As ex- 
plained in [32L l33l | , that argument is not rigorous because 
the uncertainty relation (j2.ip is not exact for the usual 
standard deviation, and the uncertainty in the photon 
number is not upper bounded by the mean photon num- 
ber. The above derivation makes the Heisenberg limit 
rigorous by using the square root of the Holevo variance 
to obtain an exact uncertainty principle, and using an 
upper limit on n. 

The lower bound of 1/N on the phase uncertainty is 
not tight. The exact achievable lower bound, with opti- 
mal measurements, is 

A0HL = V^ = tan(^)~^. (2.3) 

The asymptotic result was found in the single-mode case 
in Ref. [34] . and the exact result was found in Ref. 
This bound in the case of two-mode interferometry, where 
N is the total number of photons, was found in Refs. 
[HI, ■ For two- mode interferometry, n is the number of 
photons passing through the phase shift. An alternative 
scenario for phase measurement was considered in Ref. 
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[26l |. There it was shown that the same limit holds in 
a general situation involving N applications of a phase 
shift to qubits, interspersed with unitaries. 

A unified way of defining N that is independent of 
the physical implementation is as follows. The complete 
measurement scheme, including any feedback, may be 
represented by preparation of a pure quantum state that 
depends on a parameter followed by measurement. De- 
note the family of such states, parametrized by 0, as 
| if}(4>)). The Fourier transform of this family of states 
may be taken independently of the basis, and can be de- 
noted \ip(s)). Then N can be defined as the minimum 
size of the interval that supports \ip(s)). See Appendix 
[C]for detailed explanations. 

Because of the way the Heiscnbcrg limit is derived, 
there are a number of conditions on its validity: 

1. The error is quantified by the square root of the 
variance of the difference between the phase esti- 
mate and the system phase. 

2. N is the total number of applications of the phase 
shift (the total number of photon passes in quan- 
tum optics). 

3. There is no a priori information about the system 
phase; the prior probability distribution is flat on 
the interval [0,2-71"). 

4. The variance is evaluated by averaging over all pos- 
sible system phases, using this flat distribution. 

Essentially these conditions require the measurement to 
be a self-contained measurement of a completely un- 
known phase. As these are the conditions on the Hcisen- 
berg limit, methods to achieve the Heiscnbcrg limit, or 
scaling as the Heiscnbcrg limit, should satisfy these con- 
ditions. For the measurement schemes that we describe 
here we are careful to ensure that all of these conditions 
are satisfied. It is possible to derive a similar limit on 
the measurement accuracy in the phase sensing regime, 
where it is required to measure small phase shifts [ill ]. 
That limit is also called the Heisenberg limit, though 
that definition of the Heiscnbcrg limit differs from the 
definition used here. 

The condition that the error is quantified by the square 
root of the variance is the most stringent condition on 
the error, because the standard deviation can be used to 
place bounds on all other commonly used measures of 
the uncertainty [38|. On the other hand, it is possible 
for other measures of uncertainty to give unrcalistically 
small values that arc not meaningful. In particular, the 
reciprocal-peak likelihood can give an uncertainty scaling 
as 1 /n 2 in the mean photon number [3^ | . That small un- 
certainty is not meaningful because it does not translate 
into a correspondingly small standard deviation [33] ■ 

For most other measures of uncertainty, if they are 
small then (even if the variance for a single measurement 
is large) results from separate measurements can be com- 
bined to yield an overall estimate with small variance. 



The problem is that this does not necessarily preserve 
the scaling; a fact which is often ignored in the analysis. 
If a single measurement uses N\ applications of the phase 
shift, and there are M repetitions of the measurement, 
then the total resources are N = MN\. The variance 
can then scale no better than l/(iVivM) = vM/N. 
Sometimes scaling as 1/ '(Niy/M) has been referred to 
as Heisenberg-limited scaling (for a recent example, see 
Ref. [13|). However, that is not strictly correct because if 
the number of repetitions M required increases with N\ , 
then the Heisenberg limit of 0(l/iV) scaling will not be 
obtained, as pointed out recently in Ref. [4l|. 

Note also that the first condition requires that the er- 
ror is measured between the actual system phase and the 
phase estimates. This prevents biased phase estimates 
giving unrealistically small values of the error. This con- 
dition also means that the uncertainty should not be 
based entirely on the probability distribution for the sys- 
tem phase based on the measurement results. That is, 
the uncertainty is the spread obtained in the phase es- 
timates from the measurements, rather than the spread 
in the Bayesian probability distribution obtained from a 
single measurement. 

The second condition is simply that the resources are 
quantified in the same way as in the proof of the bound. 
This means that the Heisenberg limit, as defined here, 
is not beaten by schemes such as those in Refs. jH, Hp], 
where the resources are quantified in terms of the num- 
ber of interacting systems. In these papers the number 
of applications of the phase shift is equivalent to z't || if || 
using their notation (see Appendix [Cj . The variance ob- 
tained in Refs. [42|, O is not smaller than 7r/(i/i||.ff||). 
Note also that, provided the resources are quantified in 
this way, it is not possible to obtain better scaling than 
1/N by combining parallel and serial resources. In par- 
ticular, for optics, N is the number of photon passes, so 
a state with n photons (the parallel resource) that passes 
through a phase shift p times (the serial resource) is us- 
ing total resources N = np. This means that it is not 
possible to obtain better scaling than 1/N by, for exam- 
ple, performing the phase measurements we propose with 
multiple passes of NOON states. 

The third condition means that no preexisting infor- 
mation about the phase can be used in the measurement. 
The fourth condition makes this rigorous by ruling out 
the possibility of just considering one system phase where 
the measurement is particularly accurate. This prevents 
the measurement from implicitly using information about 
the system phase. In contrast, work on phase measure- 
ment often considers the regime of small phase shifts, 
where the performance of the measurement is considered 
only for a small range of phases. This approach may 
be reasonable for states that are close to classical, but 
breaks down if all the resources are concentrated into a 
single nonclassical state in order to approach the Heisen- 
berg limit. For example, if all resources are concentrated 
into measurement with a single NOON state, then ac- 
curacy of Q(l/N) would be obtained only if the phase 



shift is smaller than 1/N; otherwise the phase estimate 
would be ambiguous. That is, for any fixed phase shift, 
no matter how small, it would not be possible to main- 
tain Heisenberg-limited scaling for arbitrarily large N. 
The phase measurement schemes that we present here 
show how to eliminate phase ambiguities to achieve ac- 
curacy 0(l/iV) without needing any initial knowledge of 
the phase. 

III. THEORETICAL BACKGROUND 
A. Interferometric measurements 

Interferometric measurements are typically considered 
via the Mach-Zehnder interferometer, as in Fig. [T] Two 
input modes are combined at a beam splitter, after which 
each of the modes is subjected to a phase shift, and the 
two modes are rccombined at a second beam splitter. 
The phase shift to be measured, <f>, is in one arm, and a 
controllable phase shift, $, may be added in the second 
arm. The first beam splitter is not necessary for the 
analysis, and it is more convenient to consider the state 
in the arms of the interferometer, i.e. in modes a and b. 

The optimal phase measurement, also called the canon- 
ical measurement, can be imagined to be performed di- 
rectly on the modes in the arms, thereby also omitting 
the second beam-splitter. The phase statistics of such 
a measurement can be regarded as the intrinsic phase- 
difference statistics of the state of these two modes. The 
joint state of the modes a and b (as shown in Fig. [T]) can 
be written as a superposition of joint number states. We 
make the usual assumption that the coefficients in this 
superposition are positive and real before application of 
any phase shift. If the total photon number is fixed at 
N, then the canonical positive operator-valued measure 
(POVM) is of the form @, E| 

1 N 

Fca„(0) = Y \$){4>\, \4>) = £ e*"*|n>|JV - n). 

71=0 

(3.1) 

This POVM gives the probability density for the contin- 
uous quantity </>; that is, the probability of the measure- 
ment result being in the interval A is 

P(4> G A) = f Tr[F cari [4>)p}4- (3-2) 

J A 

This measurement is applied to modes a and b after ap- 
plication of the phase shift </>, but without the phase shift 
$ (or with $ taken to be zero). The result of the mea- 
surement, </>, is then the estimator for the unknown phase 
<t>. 

Except in special cases, it is not possible to perform 
canonical measurements with standard optical equip- 
ment (photon counters and linear optical elements such 
as beam splitters). To approximate canonical measure- 
ments, Refs. (37l |38| use adaptive phase measurements, 
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FIG. 1: The Mach-Zehnder interferometer, with the addition 
of a controllable phase $ in one arm. The unknown phase to 
be estimated is <j>. 

an idea introduced in Ref. The sequence of detec- 

tions is used to obtain a Bayesian probability distribution 
for the phase. The phase <E> is then adjusted to minimize 
the expected variance after detection of the next photon. 
Note that the time between detections gives no informa- 
tion about the phase. In addition, for fixed total photon 
number, N , the measurement operator for no detection 
does not alter the state. This means that the time be- 
tween detections can be ignored in the analysis. 

To obtain the probability distribution for the phase, it 
is convenient to keep a record of the unnormalized sys- 
tem state. The unnormalized system state after m detec- 
tions, and for system phase <p, will be denoted \ip(u m , <f>)). 
Here u m = (ui, . . . , u m ) is the vector of m measurement 
results. The measurement results are taken to be Uj = 
or 1 to indicate that the jth photon is detected in mode 
Co or mode c\. The annihilation operators at the output 
of the final beam splitter, Co and Ci, are 

c = [e < *o+(-l)V*6]/V2, (3.3) 

where a and b are the operators for the field at the po- 
sitions shown in Fig. Q] After detection, the state is up- 
dated as 

Mtk.+i, <t>)) = feH V>(Mm,0)}- (3-4) 
y/N — m 

The probability for the sequence of measurement results 
is given b y P(u m \<j>) = {ip(u m , 4>)\ij){u m , tj>)). (The divi- 
sion by yN — m ensures that the probabilities for the 
different measurement results sum to 1.) 

Using Bayes' theorem, the probability for the system 
phase given the measurement results is 

P{(j>\u m ) = ^uT\ ■ (3-5) 
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The initial probability distribution, P(</>), is flat, and the 
probability P{u m ) is independent of the phase. Therefore 
the probability distribution for the system phase is 

P{(f)\u m ) (x P(u m \cj>) = (ip(u m ,(j))\tp(u m ,(j))). (3.6) 

Because the annihilation operators c u contain the ex- 
ponential e 1 ^, the unnormalized state and therefore the 
probability can be expressed in terms of powers of this 
exponential. This enables the probability distribution to 
be efficiently represented in terms of the coefficients of 
the powers of e 4 *. 

The unbiased phase estimate with the smallest vari- 
ance is 4> = arg(e^) [38|. Here the average is taken over 
<t>, for the Bayesian probability distribution. If the prob- 
ability distribution P((f>\u m ) is represented as a Fourier 
series in <fi, (e 1 ^) is simply the coefficient of the e~ l< ^ term. 
Recall that the Holevo variance in the phase estimates is 



defined as Vh = /i 



1, where /i = 



J<t>\ 



In order 



to ensure that the measurement is covariant, we take the 
initial value of $ to be random. Then the value of the 
sharpness [i is given by [38[ 



-E 



^P(u m \$)dc$> 



See Appendix [B] for a more detailed explanation. 

This expression can be rewritten as an average over 
the sharpnesses of the individual Bayesian probability 
distributions 



H = E 



J<t>\ 



(3.8) 



Here E indicates an expectation value over the measure- 
ment results and the initial feedback phase, and the angle 
brackets indicate an average over <f>. The way this expres- 
sion is interpreted is that, for any actual system phase 
and initial feedback phase, we obtain some measurement 
results u m and determine a Bayesian probability distri- 
bution for the system phase based on those measurement 
results. 

In Refs. [13, [HI an algorithm for choosing the feedback 
phase was introduced, which maximizes the expected 
sharpness of the Bayesian probability distribution after 
the next detection. Maximizing the average sharpness, 
as in Eq. (|3.8p . minimizes the variance in the phase es- 
timates. Therefore this feedback minimizes the variance 
in the phase estimates after the next detection. It might 
be thought that, since the ultimate aim is to minimize 
the Holevo variance Vh, the feedback phase should be 
chosen to minimize the expected Holevo variance of the 
Bayesian probability distribution after the next detec- 
tion. However, the variance we want to minimize is that 
in the phase estimates, not that in the Bayesian proba- 
bility distributions. If we were minimizing the variance 
in the Bayesian probability distributions, then we would 
be minimizing 



E[|<< 



1] 



We actually want to minimize the variance in the phase 
estimates, which is given by 



tfe = {E[Ke*)|]} -1. 



(3.10) 



Thus, to minimize Vh, or indeed any monotonically de- 
creasing function of fi, for the measurement scheme as a 
whole, one should aim to maximize Eq. (|3.7|) . as in Refs. 

Mm- 

Alternatively, the result that one should aim to max- 
imize Eq. (|3.7p may be shown directly from Eq. (|3.7p . 
The feedback phase $ m for detection m is a function of 
the prior measurement results Ui, . . . , u m _i and the ini- 
tial feedback phase $j. In order to maximize [i as given 
by Eq. (|3.7p . we need to optimize each of these $ m for 
each measurement record u m _i. We can rewrite the sum 
for fi in Eq. (|3~7)) as 



sir. 



e^P(u m |0)# 



(3.11) 



(3.7) Then for each u m —\ we have a sum 



l u m -i — 



u m =0 



(3.12) 



(3.9) 



that depends only on the feedback phase $ m for that 
Um-i- Thus the multivariable maximization is reduced 
to separate maximizations of a single variable for each 
iim-i- As there arc only two terms in the sum for fJ,n m _ 1 , 
there is an analytical solution for the feedback phase <E> m 
which maximizes this function, as detailed in Ref. [38j . 

In Ref. [38| it was shown that this approach can, in the- 
ory, yield an uncertainty only slightly larger than (|2.3|) 
with the optimal input states. The problem is that these 
states have not yet been produced experimentally. The 
simplest A-photon input state to realize in practice is 
that with all N photons entering one port of the first 
beam-splitter in Fig. [T] (with vacuum at the other port). 
In this case, with the single-pass interferometer design of 
Fig-HJ the variance in <f> can scale no better than 0(A _1 ) 
(corresponding to an uncertainty of 0(A -1 / 2 )). This 
limit can not be surpassed regardless of how effective the 
adaptive measurements are, or even with canonical phase 
measurements. To beat this scaling it is necessary to use 
nonclassical multiphoton states, or a variation in the in- 
terferometer design that achieves the same end. The non- 
classical states with the highest resolution, NOON states 
Q, give a Holevo variance that is formally infinite, due 
to the manifold ambiguity in the phase estimate. The 
same is true if one simply increases the number of passes 
through the phase shift. Practical solutions to this prob- 
lem [25I [27j have come from considering phase measure- 
ments in quantum computing, which are discussed in the 
following subsection. 
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B. Phase measurements in quantum computing 

In quantum computing, phase estimation is used for 
finding an eigenvalue of a unitary operator U. If an 
cigcnstatc, \u), of U is known, the task is to find the 
corresponding eigenvalue e 1 *. A control qubit is placed 
in the state (|0) + \ l))/y/2, and target qubits are placed 
in the state \u). A controlled-?/ 2 operation (i.e., an op- 
eration that applies the operator U 2 ^ to a target qubit 
conditional on the state of the control qubit) then trans- 
forms the control qubit into the state (|0) + e i2 ' rt '\l)) / y/2. 
One approach to finding the phase is the quantum phase 
estimation algorithm (QPEA) @, Q , in which one per- 
forms this procedure on identically prepared qubits for 
k = 0, . . . ,K Q. This yields a state of the form 

2 K+l_ 1 

E e ^lf). ( 3 - 13 ) 

y=o 

where \y) is a state on the K+1 control qubits. Provided 
the phase has an exact if -bit binary expansion of the 
form 4> = 27rx O.ao ■ • • ax (where the a.j are binary digits), 
performing the inverse QFT yields the state |eto ■ ■ ■ clk), 
so measurement in the computational basis yields the 
digits of the phase. If the phase is not of the form <fi = 
27r x O.ao • • • ax, then accuracy to this number of digits 
(with success probability 1 — e) can still be achieved by 
increasing K to K> = K + 0(ln(l/e)) 0. 

An alternative approach was given by Kitaev Q. 
There, rather than using the inverse QFT, separate mea- 
surements are used. For each k the number of qubits 
used is 0(\n(K/e)), which enables 2 k (f> to be localised 
in one of 8 subintcrvals of [0, 2ir] with error probability 
< e/l. By combining measurements for different values 
of k, the value of 4> is determined with precision ir2~ K ~ 2 
and error probability < e. In this paper we mostly take 
inspiration from the QPEA, but in Sec. lVIIBl we also use 
ideas from Kitaev's algorithm. Note that the QPEA was, 
rather inaccurately, referred to as the Kitaev algorithm 
in Ref. (H). 

Quantum computing algorithms that require phase es- 
timation are for discrete problems, which arc typically 
quantified by the error probability (i.e., the probability 
that an incorrect answer is given). For such problems, we 
usually require that the phase error is smaller than a cer- 
tain amount in order to obtain the correct final answer. 
If the phase error is larger than this, its magnitude is ir- 
relevant. We contrast this situation to the case in optics, 
where phase estimation is typically treated as a continu- 
ous problem for which the magnitude of any error is also 
important. Therefore, the quality of a phase estimation 
measurement is better quantified by the variance rather 
than a confidence interval. Also, in quantum computing 
it is often possible to implement the controlled unitary 
operation efficiently for arbitrary k, so the resources used 
scale as K. In contrast, for optics implementing a mul- 
tiple of the phase shift requires more resources, so the 



total resources used scale as 2 . 

To implement quantum-computing style phase mea- 
surements in quantum optics, the multiples of the phase 
shift can be achieved using either NOON states or mul- 
tiple passes through a phase shift. The inverse QFT can 
be implemented by using the simplification of [24| which 
requires only local measurements and control. 

Using a single photon as the input to a Mach-Zchndcr 
interferometer, the state in the arms of the interfer- 
ometer is (|0) + |1))/V2, where |0) and |1) are the 
states corresponding to the photon in one arm or the 
other. Normally the phase shift </> changes the state to 
(|0)+e^|l))/\/2. With v passes through the phase shift, 
the state is changed to (|0) + e""^|l))/\/2. The beam 
splitter then acts as a Hadamard operator, and the pho- 
todetectors give a measurement in the computational ba- 
sis. With a controllable phase of $ in the other arm, the 
probabilities of the measurement results are 

P{u\<f>) = - [1 + (-l) u cos(W - *)]■ (3.14) 

Alternatively, using a NOON state (|j/0) + \0u))/V2 in 
the arms of the interferometer, the phase shift changes 
the state to {e iv *\vQ) + \0v))/V2. Until detection of all 
v photons, no phase information is obtained, so there is 
no information on which to base a feedback phase. After 
detection of all photons, the probability of the sequence 
of measurement results is 

P{u v \4>) = ^{1 + (-l) u i+-+"» cosK^ - $)]}. (3.15) 

This means that the probability distribution for the 
phase, given the measurement results, only depends 
on the parity of the measurement results; i.e. whether 
u± + . . . + u v is odd or even. The phase information ob- 
tained is identical to that for a single photon with multi- 
ple passes, with the single measurement result u replaced 
with the parity, and the controllable phase <E> replaced 
with From this point on the parity obtained from 
the NOON measurement will be represented as a single 
measurement result, u. 

To apply the phase measurements from quantum com- 
puting theory to optics, it is necessary to use multiple 
time modes. That is, independent single photons are 
used, or NOON states that are sufficiently spaced apart 
in time that the photons from the different NOON states 
can be unambiguously distinguished. The two cases are 
mathematically equivalent. For brevity, for the remain- 
der of this paper, we present the analysis in terms of 
NOON states, but the analysis also holds for multiple 
passes of single photons, as well as phase measurements 
in quantum computing, because they are mathematically 
equivalent. The techniques we present can also be ap- 
plied to Hamiltonian parameter estimation, as discussed 
in Appendix [Cl though there is the complication that the 
range of the parameter is not limited to [0, 2tt). Multiple 
passes of single photons were used in the experimental 
demonstration below and in Refs. [25l. [27|. 
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NOON states are often said to provide super-resolution 
of phase, due to the multiple peaks in the phase distri- 
bution obtained in the interval [0, 2tt]. If a measurement 
is performed with just a single NOON state, there is a 
problem in that it is not possible to distinguish which of 
these peaks corresponds to the phase. This results in the 
Holevo phase variance being infinite. To obtain a use- 
ful phase estimate, the phase information from a NOON 
state needs to be combined with other phase information. 

The QPEA, with the QFT implemented according to 
the scheme of [Jlj . provides a method to combine the 
phase information from different NOON states to yield 
an unambiguous measurement of the phase. Initially the 
controllable phase $ is set to be zero. The first measure- 
ment is taken with a NOON state with v = 2 K . Provided 
the system phase has an exact K-bit binary expression 
of the form <f> = 2it x O.ao • • • a^, this measurement yields 
the value of a,K as the first measurement result, u\. Given 
this measurement result, the feedback phase is adjusted 
by uitt/2 k such that <\> — $ = 2n x 0.a • • • clk-i- Then 
the next measurement with v = 2 K ~ 1 yields the next 
digit ok_i, and so forth. In this way, all digits ak are 
determined with certainty. 

This procedure gives a measurement described by a 
POVM of the form 



F^ l ) = \ ( / )l }(M, \4>i) 



i 



(3.16) 

where N = 2 K+1 - 1, and fa = nl/2 K , for I = 0, . . . , N. 
The state \n, N — n) is labeled by the total number of 
photons in each arm. That is, \n, N — n) = |n)®|JV — n), 
where 



|n) = |n ) ® .. . ® \n K ), 



(3-17) 



and where the are the binary digits of n, and similarly 
for \N-n). 

Since the QPEA gives a phase estimate with K bits 
of accuracy, at the cost of 2 K resources, it would seem 
that it should enable phase variance near the Heisenberg 
limit. Surprisingly, this is not the case. In fact, it gives a 
variance above the SQL, as we explain in the next section. 



IV. EQUIVALENT STATES FOR MULTIPLE 
TIME MODES 

To assist in determining the phase variance yielded by 
phase measurement algorithms, we introduce the concept 
of equivalent states. In general, consider two different 
systems with respective sets of basis states {|Xn)l and 
{|£„)}, such that a phase shift <fi adds a factor of e m ^ to 
each basis state. States in the different systems that are 
identical except for the basis may be regarded as equiv- 
alent. That is, Y, n tp n \Xn) is equivalent to J2 n V'nl^n)- 
The canonical measurements are identical except for the 
basis states used, and the distributions obtained for the 
phase are identical. 



In order to analyze a state with multiple time modes, 
we determine the state with a single time mode that it 
is equivalent to. We call this the two-mode equivalent 
state, because it has two spatial modes and just a single 
time mode. The sequence of K NOON states with the 
photon number increasing from 1 to 2 K by powers of two 
is 

W*) = 2( a-|i)/2 (I 2 *' ) + |0,2*»®...®(|1,0) + |0,1» 

(4.1) 

This state is changed under the phase shift to 



10,2 



K< 



...®(e**|l J 0) + |0,l». (4.2) 

Expanding the product, there is a sum of orthogonal 
states such as \2 K , 0) <g> |0, 2 K ~ 1 ) ® . . . ® |1, 0). Each state 
has a coefficient of e m ^, where n is the sum of the photon 
numbers in the first modes from each pair of modes. The 
state (|4.1[) is therefore equivalent to, in terms of its phase 
properties, an equally weighted superposition state with 
one time mode 



1 N 
2(*"+i)/2 51 

71 = 



n, N ■ 



(4.3) 



with N = 2 K+1 - 1. 

In terms of these equivalent two-mode states, the 
POVM for the QPEA can be written as in Eq. i[3T6]) . 
with the states now being the equivalent two-mode states. 
This POVM is close to a canonical measurement, ex- 
cept that the possible phase measurement results are re- 
stricted to cf>i = irl/2 K . To obtain a canonical measure- 
ment, the initial feedback phase $ x can be taken to be 
random, rather than zero. For a particular value of this 
initial feedback phase, the POVM is the same as that in 
Eq. (|3.16[) . except with <pi = $i + nl/2 K . Using random 
$i, all <j> are included in the POVM, so the measure- 
ment becomes the canonical phase measurement as in 
Eq. (51) . 

Using the canonical measurement, the value of fj. for 
this equal superposition state is fi = N/(N + 1) = 1 — 
2~( K+1 \ Therefore, the Holevo phase variance is 



1 



(1 - 2~( K + 1 )) 2 



1 



2 

N 



1 



(4.4) 



This scales as 1/N which is the SQL, rather than the 
expected Heisenberg limit. To see how this is compatible 
with the precision of the QPEA one must examine the 
probability distribution for the phase estimate. Using \n) 
to denote |n, N — n), this distribution is given by 



1 



P ^ = 2, 




sin 2 [(V + l)(0-0)/2] 
27r(V + l)sin 2 [(0-(/))/2]' 



(4.5) 
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Clearly, this distribution has a peak (around (f>) with a 
width 0(N^ 1 ), as expected for scaling at the Heisenberg 
limit. But its variance scales as the SQL because of its 
high tails. This is most e asily seen using the Collett phase 
variance Vc = 2(1 — fx) [47]. The Collett variance coin- 
cides with the Holevo variance whenever either is small 
[HI . Using x = </> — (f) for the phase error, we have 



y c = (4sin 2 [(0-0)/2]) 



sm 2 (Nx/2)dx. 

(4.6) 

That is, the tails are so high that the variance integral is 
finite only because of the finite range of the phase. The 
integral evaluates to 2/(N + 1), the same SQL scaling as 
found above for Vh- 

We have already seen that the QPEA realizes a canon- 
ical phase measurement. Therefore if it fails to attain the 
Heisenberg limit, the fault must lie with the state (|4.3|) . 
One approach is to replace the state (|4.3j) with a more 
general state of the form 



N 



y^-0 n |n, N -n). 



(4.7) 



n=0 



Because the measurement is canonical, provided the co- 
efficients V^n are optimal the minimum phase uncertainty 
(|2.3|) could be obtained. 

Unfortunately the minimum phase uncertainty state 
[34l . [3BI l36l . [371 is not separable between the different 
time modes, and would therefore be extremely difficult 
to create. However, it is not necessary for the state to 
be a minimum phase uncertainty state in order to obtain 
a phase variance scaling as the Heisenberg limit. For a 
state 2nLo V'nl 71 -! N ~ n )i the phase variance Vq is given 
by 



N 



2(1-|(^)|)= £ (ifrn-Vn 



+1 



(4. 



where ij)—-y and ipN+i are defined to be zero. Provided the 
state coefficients vary relatively smoothly, the maximum 
value of ijj n should be of order 1 / VN, and the successive 
differences should be of order A -3 / 2 . This implies that 
(tp n — ipn+i) 2 should be of order A -3 , so the variance 
should scale at the Heisenberg limit of N~ 2 . This rea- 
soning shows that the restrictions on the properties of 
the state are quite weak if one wishes to obtain scaling 
at the Heisenberg limit. 

The equally weighted state does not give phase vari- 
ance scaling at the Heisenberg limit because there is a 
large jump, of order l/y/N, between ip-i and ipo and 
between if)pf and ipM+i- In order to obtain the required 
smoothness, one approach is to use multiple copies of the 
state. The resulting equivalent two-mode state is then 
the repeated convolution of the original state (in terms 
of the squared state coefficients). The convolution has 
the general property of smoothing functions, and there- 
fore can be expected to yield scaling at the Heisenberg 
limit. In particular we can prove the following theorem. 



Theorem 1. The state 

m = w, 



Lp 7 Jx><"*|„,w K -„}) 



, (4.9) 



has a canonical phase variance of Q(hi Nk /N k ) for M ~ 
2, and Q(l/N%) for M > 2. 

This theorem means that for fixed M greater than 
2, the canonical phase variance scales as l/N 2 , where 
A = NkM . The quantity M is the number of copies 
of the state. The way we achieve a state equivalent 
to (|4.9[) is by using M copies of each of the individ- 
ual NOON states. In applying this theorem, we take 
Njc = 2 K+1 — 1, though that is not required for the 
theorem. A similar behavior with number of copies was 
found in Ref. [H[ for canonical multimode phase estima- 
tion (modulo 7r) on multiple copies of a squeezed vacuum 
state, where M > 4 copies were required to attain the 
Heisenberg limit. There the optimal number of copies, in 
terms of minimizing the canonical Holevo variance for a 
fixed mean photon number, was 8; in our case it is 3 (see 

Fig. p. 

Proof. Here we give a nonrigorous derivation of the scal- 
ing using a continuous approximation. For the rigorous 
proof of the theorem without using this approximation, 
see Appcndix[A] The state (|4. 9[) is equivalent to the state 



1 



N K 



(N K + l) M l- 



(4.10) 



where /m(^) is the number of combinations of values 
of n\,ri2, . . . , um that sum to n. This quantity can be 
regarded as the number of points in a hyperplane per- 
pendicular to a line running between opposite corners of 
a hypercubic lattice of dimension M. 

In the continuous approximation, the value of /m(^) 
is the area of the cross section of the hypercube, and is 
equal to n M ^ 1 /(AI - 1)! for n < N K . The contribution 
to the variance Vc for n < Nk is then, in the continuous 
approximation, 



1 



(N K + 1) 
1 



M 



Nk I dyT^n)' 



dn 



{N K + 1) 



,A/-3 



4[(M-2)!] 2 



dn 



dn. 



(4.11) 



There is a clear difference between the cases M = 2 and 
M > 2. For M > 2, the integral gives an expression 
proportional to N^^ 2 , so overall the expression scales 
as Q(l/N%). On the other hand, for M = 2, the inte- 
grand is proportional to 1/n, rather than of a positive 
power of n or a constant. The divergence at n = can 
be ignored, because the continuous approximation breaks 
down. However, the integral yields lnA^ from the up- 
per bound, which means the expression overall scales as 
0(ln Nk/N k ). 
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The contribution to V c for N - N K < n < N is the 
same. For Nk < n < N — Nk, we use the fact that 
f' u {n) < N K '~ 2 and f M (n) > i\#~7(M - 1)! to find 
that the integrand is upper bounded by (M - 1 )\N%~ 3 . 
This means that the contribution to Vc for Nk < n < 
N - N k is 0(1 /N K ) for both M = 2 and M > 2. Hence 
the canonical phase variance, in the continuous approx- 
imation, is e(hxN K /N K ) for M = 2 and @(1/N K ) for 
A/ > 2. □ 



V. ADAPTIVE MEASUREMENTS FOR 
MULTIPLE TIME MODES 

A. Deriving the recurrence relation 

The drawback to using these multiple copies of the 
NOON states is that the canonical measurements can 
no longer be achieved exactly using adaptive measure- 
ments. On the other hand, it is possible to approximate 
the canonical measurements for larger numbers of rep- 
etitions, M. The idea is to use a generalization of the 
adaptive phase measurements for a single time mode. As 
in that case, at each step the feedback phase $ is chosen 
to minimize the expected variance after the next detec- 
tion. 

One starts with performing M measurements, one on 
each of M NOON states, each with photon number 
v = 2 A for some integer K. Next, measurements are 
performed on M NOON states with v = 2 K ~ 1 1 and one 
continues this sequence on NOON states with v = 2 k for 
k = K, K — 1, . . . , 1. This is similar to the optical imple- 
mentation of the QPEA, except at each stage one uses M 
NOON states of each size rather than 1. There is a sub- 
tlety in using the feedback algorithm of Ref. [13] , in that 
when measurements have been performed with NOON 
states down to size v = 2 k , the phase is only known 
modulo 2~n j2 k . To address this, rather than maximizing 

|(e l ^)|, the quantity that is maximized is |(e l2 ^}|. 

Using the same adaptive scheme as before, numerical 
testing indicates that the phase variances scale as the 
Heisenberg limit for M > 3. For small values of M the 
measurements are a poor approximation of the canonical 
measurements; for M = 3 the variance scales as TV -3 / 2 , 
and for M = 2 the variance scales as A -1 . 

The feedback can be achieved in an efficient way when 
performing measurements on these states. Because the 
individual NOON states are not entangled with each 
other, measurements on one do not affect the states of 
the others, except for the normalization when considering 
the unnormalized state. Therefore, it is only necessary to 
keep track of the evolution of the normalization. Recall 
that the normalization gives the probability distribution 
for that sequence of detection results, and is proportional 
to the probability distribution for the phase. 

At each stage the probability distribution for the to- 
tal measurement results is obtained by multiplying the 
probability for the current measurement result by the 



preceding probability distribution 

P(u m \(f)) = P{u m \c/>)P{u m -i\(t>)- 



(5.1) 



The probability distribution for the current measurement 
result, with a NOON state with v = 



is 



P{u m \4>) = i{l + (-l)"cos[2 fc ( 



<£ m )]}. (5.2) 



Provided the measurements are made on the largest 
NOON states first, the overall probability distribution 
can only contain different powers of e l2 ^ . The proba- 
bility distribution may therefore be stored using just the 
coefficients of the different powers. That is, the in 
the expansion 



P(Ur 



(u m )e 



(5.3) 



are stored. 

The coefficients are updated as 



pf\u m ) = 7: { p) K '(u m -i) + 



1 



Mi 



P)-l(Ur< 



' Pj+1 («f. 



}. (5.4) 



When examining measurements for the next lower value 
of v : given by v = 2 fc_1 , the coefficients are expanded out 
by a factor of two. That is, the probability distribution 
is written as 



P(u m \ct>) 



E(fc- 
Pj 

j 



(u m )e 



(5.5) 



where p£j ("™) = (^m) and p^j+i (4) = 0. 

In addition, it is only necessary to keep track of a lim- 
ited number of the Pj(u m ). At the end of the mea- 
surements, the quantity of importance is p\ (um(k+i))- 
This can depend only on the Pj (u m ) for j in the 
range — 2M, . . . , 2M, and only these coefficients need be 
recorded. In fact, only coefficients for j = 0, . . . , 2M need 
be recorded, because those for negative j are the complex 
conjugate of those for positive j. 

To illustrate these ideas, consider the first measure- 
ment for v — 2 k , where the first few coefficients are 



M 



(fe-i). 



Po K 



) 



(AH 

Po 



(i), 



p[ k \u m )=0, 

(fc)/-. \ (AH 
P 2 Kn) =Pl 

pi (4) = o, 

where m = M(K — k). After the next detection, the first 
few coefficients are changed to 

(u m +i) =Pb 



(5.6) 



p^\u m+ i) = p ( k+1) (u m )/2, 



pf\u m+1 ) = (-lY^[p^>{u m )e 
pf\u m+ i) = p[ k +1 \u m )/2. 



„(fc+i) 



i2 K <S>„ 



(5.7) 
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The feedback phase is chosen to maximize the average 
of \p^\u m +i)\ over the two values of u rn+ i. For both 
measurement results, 



\p[ k \u m+1 )\ = ^\pl K ^ L) {u m )e 

(5.8) 

Because p (u m ) is real and positive (the probability dis- 
tribution is real and positive), the feedback phase that 
maximizes this average is 

<W = -2-( fc+1 > ^g[pf +1 \u m )\. (5.9) 
We now investigate this solution for various values of M. 



1 



B. Analytic result for M = 1 



In the case M = 1, Eq. (|5.9|) yields a recurrence rela- 
tion that can be solved, yielding [5^ 



P '(UK-k+l) 
Pi (U/f-fc+l) 



X CXp — i7T 



. 2^-^-1 

2 Jf-fe+i _ x 
" 2 2 (^-fe+i) 

K-fe+l \ 

£ Ui /2 K - fc+1 -' 



(5.10) 



The feedback phase obtained is 

$tf_ fe+ l = 27T ^l/2 K+1 - 1 
1=1 

= 2tt x 0.00 . . . QuK-kUK-k-i ■■■Ui, (5.11) 

where the expression following the "x" on the second line 
is a binary expansion. For each measurement result, ui, 
the feedback phase is adjusted by the appropriate frac- 
tion of 2ir. This feedback is identical to that obtained for 
the linear optics implementation of the QPEA described 
in Sec. IIII1 In addition, summing over [p£ (uK-k+i)\ a t 
the end of the measurement gives [i = 1 — 2~ i - K+1 \ This 
yields exactly the same variance as in Eq. 



C. Analytic result for M = 2 

It is also possible to analytically determine the variance 
for M = 2. In this case, the values of po and p\ after the 
second measurement for a given fc are 

pf Kn +2 ) = pf Kn)/4 + b^l RefrW ( ^ m ) 



Pi 



(k) 



+»-*™+0 + p( fc) (u m ) e i;2fc (*'"+ 2 +*'"+ 1 )], 
(« m+2 ) = ( ~ i r +1 [Po fc) (^)e^ 2 ^'" +1 +P 2 ' £) (« m ) 



x e 



;2' 



■^xi , (~ 1 )"'" +2 r^) 



+ J 4 fc) (a ro )e i2 **^] 



(5.12) 



Taking the absolute value of p\ yields 



lie) i -. 
Pi {u 



1 



m+2 ) 



,(fc) 



(*m+2-*m+l) 



I+ 2-« m + l^2 fc - 1 (<E. I1 



i 2 fc (*,„ +2 +* m+ i: 



(5.13) 



Regardless of whether the feedback phase (|5.9|) is as- 
sumed for (f> m+ i, or both & m +i and <f ) r „+ 2 are maximized 
over, the solution is 

2 fc ($ m+2 - $ m+1 ) = tt/2, 

2 fc ($ m+2 + $ m+1 ) = - arg[p^ ) (u m )] + n/2. (5.14) 

This means that the feedback scheme minimizes the vari- 
ance two detections in advance, rather than just one. 
Independent of the detection results, the absolute value 
obtained is 



\p[ k \u m+ 2)\ = ^vbr(^)] 2 + ipr&mW- (5.i5) 



V2 



Using this feedback also gives 



(5.16) 



This again yields a recurrence relation that can be 
solved. The result is 



■n^lit \_ 2{k-K-l) 

Pq \ u m+2) — £ 



\p[ h) (u m+2 )\ 



_ 9 2(fc-K-l 



Vl-2 fe - 



K-l 



(5.17) 



At the end of the measurement, summing over the 
\Pi ] {u2(K+i))\ gives n = Vl - 2~( K+1 l This yields the 



2(K+1 

Holevo variance in this case as 

1 



Vh = 



2 K+1 _ l 



2 

N' 



(5.18) 



Hence, in this case, the Holevo phase variance is exactly 
2/N. This is almost identical to the result for M = 1, 
where the variance is approximately 2/N. This means 
that, despite the intrinsic state having relatively good 
phase properties (with variance scaling as (\nN)/N 2 ), 
the adaptive measurements are far less accurate than the 
canonical measurement. Furthermore, because the feed- 
back phases minimize the variance two detections in ad- 
vance, it seems unlikely that a better feedback scheme 
would yield significantly more accurate measurements. 



D. Numerical results for M > 2 

For larger values of M, more accurate measurements 
are obtained, but we have not found exact analytical re- 
sults. It is possible to obtain exact results for smaller 
values of K simply by systematically calculating the fi- 
nal values of pf^ for every possible combination of mea- 
surement results. For larger values of K this approach is 
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FIG. 2: The Holevo phase variance multiplied by N 2 for M = 
1 to 6. The results for M = 1 are shown as the solid line, for 
M = 2 as the diamonds, for M = 3 as the circles, for M = 4 
as the pluses, for M = 5 as the crosses, and for M = 6 as the 
asterisks. The Heisenberg limit is also shown as the dashed 
line for comparison. 



no longer feasible, but instead the variance can be esti- 
mated by generating random system phases, then gener- 
ating measurement results according to their probability 
of occurring. There arc then two alternative ways of esti- 
mating the phase variance. One is to use the mean of the 
final values of | as an estimate of [i,. The other is to 
use the variance in the final phase estimates relative to 
the system phase. Numerically it is found that using the 
values obtained for \pf' | gives a more accurate estimate 
of the phase variance. 

The results for M = 1 to 6 are shown in Fig. [5] The 
variances are multiplied by N 2 , so that the Heisenberg 
limit appears as a horizontal line. The lines for M = 1 
and 2 are not scaling as the Heisenberg limit, and are 
indistinguishable on this plot for N above about 10. The 
variances in these cases scale as 1/N, and the numer- 
ically obtained variances agree with the analytical pre- 
dictions to within the numerical precision. The lines for 
M = 4, 5 and 6 are clearly scaling as 1/N 2 . The line for 
M = 3 has intermediate scaling, and is consistent with 
an asymptotic scaling of 1/N 3 / 2 . 

Although the results for M — 4, 5 and 6 scale as the 
Heisenberg limit, they have different scaling constants. 
The variances for these cases, multiplied by N 2 , are 
shown on a linear scale in Fig. [3] The smallest scaling 
constant of about 23 is obtained for M = 5. 



VI. THE EFFECT OF INCREASING THE 
NUMBER OF REPETITIONS 

In order to obtain higher precision, there are two pa- 
rameters that can be increased. The value of K can be 



FIG. 3: The Holevo phase variance multiplied by JV 2 for M = 
4, 5 and 6. The results for M — 4 are shown as the pluses, 
for M = 5 as the crosses, and for M = 6 as the asterisks. 
The Heisenberg limit is also shown as the dashed line for 
comparison. 



x 10" 3 




500 1000 1500 2000 2500 3000 



FIG. 4: The squares of the state coefficients for the equivalent 
two-mode state for K = 4 and M — 100. 



increased, or the number of repetitions, M, can be in- 
creased. Scaling at the Heisenberg limit can only be ob- 
tained by increasing K, not by increasing M, because 
the state coefficients do not gradually increase from zero 
in the latter case. Instead they are negligible for most 
values of n, and suddenly rise to significant values near 
the centre (for example, see Fig.@|. 

The result can be shown rigorously using the uncer- 
tainty relation (in terms of the variance) Vh(^)V (n) > 
1/4 [31(, where V(n) is the variance for the photon num- 
ber. To determine the number variance, take PM( n ) = 
fAi(n)/(NK + 1), and use the recurrence relation (see 
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Appendix [X} 



/Af(n) = ^2 fM-i(n - k). 



(6.1) 



fc=0 



This yields the recurrence relation ( 
N K /2 (where (n) M = X) n npAf(«))- 
MNk/2. The same approach for (n 2 
rence relation 



n)M = {n) M -i + 
This gives (ti)m = 
} m gives the recur- 



{n )m 



(n 2 ) M - 



+ N K (2N K + l)/6+N K (n) M -i- (6.2) 



M 2 N K /4, 



Solving this gives (n 2 ) M = MN K (N K +2)/12 
so V(n) = MN K (N K + 2)/12. Using the uncertainty 
relation, we obtain the lower bound on the phase variance 



MN K (N K + 2) 



(6.3) 



Thus, with fixed K, the variance can only scale as 1/M, 
not as 1/M 2 . On the other hand, because the number 
variance scales as the square of Nk, the Heisenberg limit 
can be achieved when increasing K, as we have already 
seen. 

The predicted phase variance obtained using the adap- 
tive measurement scheme with K = 4 is plotted as a 
function of M in Fig. [5] Experimental results using mul- 
tiple passes are also shown in this figure, and agree with 
the theoretical predictions. These experimental results 
were obtained using an apparatus and methodology iden- 
tical to that described in Ref. [25|. When increasing M, 
the variance at first decreases rapidly, and approaches 
the Heisenberg limit. Then, as M is increased further, 
the variance no longer decreases at the same rate as the 
Heisenberg limit, and instead scales as the SQL for single 
passes. 



VII. SIMPLIFICATIONS OF THE ADAPTIVE 
MEASUREMENTS 

The adaptive scheme described above is complicated 
to implement in practice due to the calculation required 
to determine the feedback phase. This calculation is non- 
Markovian, in the sense that it depends in a nontrivial 
way on all the prior measurement results. For this reason 
it is useful to develop schemes using simplified (Marko- 
vian) feedback, or no feedback. Two alternative schemes 
were proposed and experimentally demonstrated in Ref. 
[2t1 |. Here we provide further motivation and theoretical 
analysis of these alternative measurement schemes. 



A. Hybrid scheme 

One scheme is based on a simple modification of the 
QPEA. The QPEA is Markovian, because the increment 
in the feedback phase depends only on the immediately 
preceding result. As discussed above, the QPEA gives 
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FIG. 5: The theoretical predictions and experimental results 
for adaptive measurements with K = 4 and a range of values 
of M (given by M — N/31). The solid line is the predictions 
for the adaptive scheme, the dashdotted line is the intrinsic 
variance, and the crosses and error bars are for the experimen- 
tal data. The SQL for single passes is shown as the dotted 
line, and the dashed line is the Heisenberg limit. 



a canonical measurement, but the state is equivalent to 
an equal superposition, and thus has poor phase prop- 
erties. It yields a probability distribution for the phase 
with tails that drop off slowly, and give the dominant con- 
tribution to the phase variance. The idea behind hybrid 
measurements is to use additional "standard" measure- 
ments (single passes of single photons) to reduce the size 
of the tails of the distribution. The hybrid measurements 
are still Markovian, because they do not use any feed- 
back beyond what is used in the QPEA. However, these 
measurements do not scale at the Heisenberg limit, and 
instead the variance scales at best as close to 1/N 3 / 2 . In 
particular, the result is given by the following theorem. 

Theorem 2. The canonical phase variance of the state 



Ml 



K 



/N K 



vn=0 

<g)M 



(7.1) 



® (|0,1) +e*\l,Q)y 
scales as 0(iV" 3 / 2 ), where N = N K + M. 

Here VL is the standard notation for a lower bound on 
the scaling. In applying this theorem, we take Nk = 
2 K+1 — 1 to be the number of photons used in the QPEA, 
and M to be the number of photons used in the "stan- 
dard" interferomctry. 

Proof. To obtain the equivalent two-mode state for M 
repetitions of measurements with single photons we apply 
the recurrence relation (|6.ip . Repeatedly applying this 
recurrence relation gives 



/m(«) 



(7.2) 
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This result can also be obtained from the fact that this 
number is the number of ways of choosing n ones from M 
bits. Then applying the recurrence relation again with 
the flat distribution gives 



/(") 



N K 

E 

k=0 



M 

n — k 



This gives the probability distribution for the n, 

N K 



p(n) 



1 



2 M (N K + 1) 



k=0 



M 

n — k 



(7.3) 



(7.4) 



Evaluating the moments for this probability distribution 



gives 



(n) = {N K + M)/2, 

{n ^» {M + 1) + ^i + ^( 1 + 2N K y (7.5) 

This gives the variance in n as (3M + 2N K + N K )/12. 
As above, using the uncertainty relation for the Holcvo 
variance yields 



Vk(0) > 



3M + 2N K 



(7.6) 



This relation implies that if N K G 0(iV 1/2 ), then the 
SQL is obtained. To obtain better scaling than the SQL, 
it is necessary for Nk to increase with N more rapidly 
than s/N. 

This result on its own does not not show that the in- 
trinsic variance cannot scale better than iV -3 / 2 , and it is 
necessary to also examine other features of the equivalent 
two-mode state. The problem with the equivalent two- 
mode state is that the state coefficients rapidly increase, 
then there is a wide flat region (for example, see Fig. [BJ, 
rather than the gradual increase to a maximum which is 
required. 

To put a bound on the phase variance, consider the 
value of p(n) for n = n_ = [M/2 — \^M] and n = 
n + = [M/2 + vMJ. These values are chosen as points 
between which the p(n) vary rapidly (see Fig. [6J. Be- 
cause M < N, and Nk increases more rapidly than s/N, 
asymptotically we must have Nk > 2yM. Using this, 
and Hoeffding's inequality we obtain 



1 



2 M {N K + 1) 



E 



> 



l y 



k—n+—NK 
n+ 



> (1 - 2e- 2 )/(N K + 1) 



(7.7) 



and 



p(n-) 



2 M (N 



1 



k=n_-N h 



< e-' 2 /(N K + 1) 



(7.8) 
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FIG. 6: The squares of the state coefficients for the equivalent 
two-mode state for hybrid measurements with Nk = 127 and 
M — 64. The pluses show the positions of n_ and n+. 



In addition, it is easily verified that n + — n- 
so 



n+ — 1 



Wp{n + 1)- Vpjnj) 2 > 



(Vl-2e- 2 - e - 1 ) 2 
4{N K + l)^AI/3 

= , (7.9) 



> 



2k(N k + 1)VM 

where k w 4.9. The points n_ and n+ are those between 
which the state coefficients rapidly increase. There is also 
a second region for large n where the state coefficients 
rapidly decrease. The symmetry of the distribution im- 
plies that the sum over that region has the same lower 
bound. Using Eq. (|4.8|) . and the fact that Vh > Vc, the 
variance for sufficiently large N is lower bounded by 



1 



k{N k + l)VM 



(7.10) 



The smallest possible value for this lower bound is ob- 
tained for both Nk and M of order N, which gives 



Vn($) = Sl(N- 3 / 2 ) 



(7.11) 



□ 



In Ref. [27l | it was proven that measurements could 
yield a variance scaling as 0(\/ln NN~ 3 / 2 ). The lower- 
bound scaling here is the same, except for a small Vhi N 
factor. The method used for the proof in [27| was to con- 
sider an analysis of the data where, if the phase estimates 
from the single-photon measurements and the QPEA dif- 
fered by too large an amount, the phase estimate from 
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the single-photon measurements was used, and otherwise 
the phase estimate from the QPEA was used. 

The general idea behind the proof of [27| is that 
the phase estimate from single-photon measurements 
has a negligible probability of having error larger than 
0(l/vM). Therefore, if the phase estimates from the 
single-photon measurements and the QPEA differ by 
more than 0(l/y/M), it is almost certainly the QPEA 
phase estimate that is wrong, so the phase estimate from 
the single-photon measurements should be used. The 
variance then is of order 1/M, but because it occurs with 
probability \[M /Nk, the total contribution to the vari- 
ance is 0(1/ \NkVM)). If the phase estimates do agree 
to within 0(1/M), then the contribution to the variance 
is again of order 1/(Nk\/~M)- 

The additional factor of vlnJV comes about because 
it is not quite true that the probability of the single- 
photon measurements having error larger than 0(1/ s/M) 
can be ignored. This probability must scale down with 
M to prevent it giving a contribution to the variance 
larger than 0(1/(NkVM))- To ensure that this prob- 
ability is sufficiently small, the size of the error should 
be 0(y/(]nM)/M). This leads to the additional VlnN 
factor in the final result. 

The analytical results show that there is a lower bound 
on the variance for hybrid measurements of S!(iV~ 3 / 2 ), 
and an upper bound of 0(VhiAW- 3 / 2 ). The scaling 
must be close to N~ 3 / 2 , but these bounds leave open the 
question of whether there is an additional logarithmic 
factor. The hybrid measurements have been simulated 
for values of K up to 8, and the results are shown in Fig. 
[T] The variances shown are multiplied by N 3 ^ 2 to make 
the scaling clearer. For the results shown the value of 
VrN 3 / 2 increases with N, suggesting that the variance 
is slightly larger than 0(N~ 3 / 2 ). 

In Fig.[jj results are shown for increments in $ of ir/M 
and 7r/2. The variance for the single-photon measure- 
ments on their own is slightly smaller with the n/M in- 
crements than with the 7r/2 increments. However, for 
the hybrid measurements, using increments of tt/2 yields 
significantly better results. Results are also shown for 
M = 2 K , as well as for where the value of M has been 
adjusted to minimize VrN 3 / 2 . The motivation for us- 
ing M = 2 K is that the analytical results suggest that 
M ~ N/i is optimal. Numerically it was found that 
slightly higher values of M gave slightly better results, 
particularly for $ increments of n/M. 



B. Nonadaptive scheme 

It might be expected that approaching the Heisenberg 
limit requires an adaptive measurement. Normally the 
variance would be approximately the sum of the canoni- 
cal phase variance and an additional variance due to the 
measurement technique [35j . and nonadaptive measure- 
ments would introduce a variance scaling as the SQL [38j . 
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FIG. 7: The variance for hybrid measurements. The solid line 
is the canonical phase variance for the equivalent two-mode 
state. The crosses and pluses are for M = 2 K , and increments 
in $ of 7r /M and 7r/2, respectively. The diamonds and circles 
are for values of M that give the minimum value of VnN 2 , 
and increments in $ of tt/M and it/2, respectively. 



Despite this, it is still possible to achieve the same scal- 
ing as the Heisenberg limit with a nonadaptive technique 
27l | . The reason why this is possible is that the non- 
adaptive measurement with multiple time modes has a 
different POVM from the nonadaptive measurement on 
a single time mode. 

A new feature of the nonadaptive technique of [27[ is 
that the number of repetitions, M, is now a function of 
the values of K and k. The largest value of M is for 
a single-photon NOON state, and the number of rep- 
etitions is decreased as the size of the NOON state (or 
number of passes in Ref. [27]) is increased. The reason for 
changing the number of repetitions is essentially that the 
role of the measurements for smaller values of k is to dis- 
tinguish between the multiple phase estimates provided 
by larger values of A;. If the measurements for smaller val- 
ues of k do not distinguish between these phase estimates 
accurately, then the resulting error is large. On the other 
hand, because the resource cost for the smaller values of 
k is small, the cost of repeating these measurements is 
low. It is therefore better to repeat these measurements 
more often, in order to prevent these large errors. 

If the number of repetitions is not changed with k, 
then the numerical results indicate that the scaling of 
the variance is not as good as 1/N 2 . The numerical re- 
sults are shown in Fig. [5] The value of VnN 2 increases 
approximately linearly in IniV, indicating that the seal- 
is as (In N)/N 2 . For these results the value of M 



mg 

was increased with K, even though it was independent 
of k. It was found that the best value of M increased 
nearly linearly in K, from about 10 for K = 2 to 24 
for K = 9. The primary contribution to the variance 
was from low-probability results with large error. This 
meant that extremely large numbers of samples need to 
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FIG. 8: The variance for nonadaptive measurements with M 
optimized as a function of K, but independent of k. The 
crosses are for 2 20 samples and the pluses are for 2 25 samples. 



be used to obtain accurate estimates of the variance. Cal- 
culations with 2 20 samples yielded lower estimates of the 
variance than those for 2 25 samples, because they did not 
sufficiently sample the low probability results. 

The reason why M needs to be increased with K in 
the nonadaptive case can be understood in the following 
way. For simplicity, consider just the measurements with 
k = (a single-photon NOON state), and assume that 
the measurements with larger values of k are sufficient to 
narrow down the estimates of the phase to <fi and <f> + it. 
The measurements with k > measure the phase mod- 
ulo 7r, so they cannot distinguish between these alterna- 
tives. They do not give the phase exactly modulo tt, but 
assuming that they do can only decrease the phase vari- 
ance. The measurements with k = could distinguish 
between the two alternatives if they were known. 

In the adaptive case, the phase is known accurately 
(though not perfectly) modulo tt at the stage the mea- 
surements with k = are performed. The feedback phase 
$ can therefore be chosen so as to efficiently distinguish 
between the two alternatives. However, in the nonadap- 
tive case, the values of $ used are completely indepen- 
dent of the actual phase. In the case where there is a 
fixed value of M , there is a nonzero probability that the 
measurement results will be more consistent with <j) + tt 
rather than <fi, so there is a tt error in the phase estimate. 

In particular, the probability that all the measurement 
results with k = are consistent with <j) + tt rather than 
is 

M 

terror = ]J ^(1 - | COs(0 - $ m )|). (7.12) 
ra—1 

There will be other combinations of measurement results 
that are also more consistent with <fi + 7r, but the prob- 
ability for this combination on its own provides a lower 



FIG. 9: The variance for nonadaptive measurements with M 
a function of both K and k. The results for M(K, k) — 2 + 
3(K — k) are shown as diamonds, the results for M(K,k) = 
4 + 2(K — k) axe shown as both the pluses and crosses. The 
crosses are for 2 20 samples and the pluses are for 2 25 samples. 
The circles are M(K, k) = 1 + K — k for K = 1 and M(K, k) = 
1 + 4(A' - fc) for K = 2. 

bound on the overall probability that the final phase es- 
timate is 4> + tt. For some particular values of the system 
phase this probability will be zero, but averaging over all 
system phases gives a nonzero error probability. 

This lower bound on the probability of a measurement 
result that gives a tt error in the phase estimate means 
that if nonadaptive measurements are performed with a 
fixed value of M, the phase variance can not decrease 
below some minimum value. More generally, if the value 
of M is allowed to depend on K (but still be independent 
of k), then there need not be a fixed lower bound on 
the variance, but because of Eq. (|6.3[) the variance can 
scale no better than M(N)/N 2 . As M must increase 
with K (and therefore TV) in order to remove the lower 
bound on the variance, the variance does not scale at the 
Hciscnbcrg limit. 

For the nonadaptive scheme given in Ref. (27j the vari- 
ation of M used was M (K, k) = 2 + 3(K - k). Nu- 
merically it was found that this variation of M gave the 
smallest asymptotic variance that could be reliably cal- 
culated. The value of V-rN 2 was approximately 40.5 for 
values of N up to more than 10 7 (see Fig. [9]). This cor- 
responds to an uncertainty approximately 2.03 times the 
Hciscnbcrg limit. 

Alternative values of M given by M(K, k) = 4 + 
2(K — k) did give slightly smaller values of V^N 2 for 
large N. However, calculations in this case were not re- 
liable because the variance depends heavily on results 
with large error and small probability. Using 2 20 sam- 
ples gave significantly smaller values of V^N 2 in most 
cases, except for one value which was much larger. When 
the number of samples was increased to 2 25 the value 
of VrN increased to approximately the same as for 
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M(K, k) = 2 + 3(K - k) for A up to about 10 5 . For 
larger values of A the variance was still smaller, but it 
is likely that the number of samples is still too small to 
obtain a reliable result. 

Using other functions for M(K, k) did give slightly bet- 
ter results for small values of K . For K = 1, the func- 
tional dependence that gave the smallest value of VhA 2 
was M (K, k) = l + K- k. For K = 2, the best values of 
M were given by M(K, k) = 1 + 4(A" - k). The results 
for these cases are also shown in Fig. [5] 



VIII. ADAPTING THE SIZE OF THE NOON 
STATE 

Up to this point we have considered a fixed sequence 
of NOON states, and the only parameter which was var- 
ied adaptively was the feedback phase. It is also possible 
to consider adjusting the size of the NOON states used 
according to the measurement results. The most obvi- 
ous way of doing this is to select the NOON state that 
minimizes the expected variance after the next detection. 
The problem with this approach is that it will select large 
NOON states that give small variance, but result in a 
greater use of resources. As the aim is to obtain the 
smallest possible scaling constant for V^N 2 , a better ap- 
proach is to choose the NOON state that minimizes this 
quantity. 

To determine results for a specific value of A, the sizes 
of the NOON states were restricted such that the total 
resources did not exceed A. The resulting variances ob- 
tained arc shown as the crossed circles in Fig. [TO] For the 
larger values of A (2 11 = 2048 and above), it was only 
feasible to use 2 10 samples, so these results have low ac- 
curacy. Nevertheless it is clear that using this approach 
yields a significantly higher variance than simply using 
adaptive measurements with a fixed sequence of NOON 
states. In addition, the scaling of the variance appears 
to have increased to (In A) /A 2 . 

It is possible to improve on this adaptive scheme by 
using a radically different approach [28[ . There are three 
main features of this approach: 

1. The phase estimation begins with 100 measure- 
ments with single photons and no feedback (similar 
to the hybrid technique introduced in Sec. I VII Aj) . 

2. After the first step, the feedback phase is chosen 
such that the expected probabilities for the two 
measurement results are equal. 

3. After the first step, the size of the NOON state 
is chosen to minimize the expected entropy of the 
phase distribution after the next measurement. 

More specifically, the size of the NOON states is chosen to 
minimize (S^/ln A, where iV is the number of resources 
after the next detection. The motivation for dividing 
by In A can be understood in the following way. For a 
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FIG. 10: The variance for measurements where the size of 
the NOON states are adapted. The crossed circles are where 
the size is chosen to minimize the expected value of VhA 2 
after the next detection, and the sizes are chosen such that 
a particular value of N is obtained. The continuous lines are 
where all intermediate values of N are sampled. The dotted 
green line is where Vh-A 2 is minimized, the solid red line is 
where {{S) — ln(27r)]/ln N is minimized, and the dashed blue 
line is where VrN 2 is minimized, but the feedback of [2S| is 
used for the phase (color online). 



probability distribution, the negative of the entropy ap- 
proximately corresponds to the information that may be 
communicated using a distribution of that form. As the 
distribution narrows by a factor of two, the phase is ef- 
fectively known to an additional bit, and the entropy de- 
creases by 1 . Therefore the entropy is approximately pro- 
portional to the log of the phase uncertainty. The phase 
uncertainty varies as a power of A, so the entropy should 
be approximately proportional to the log of A. The best 
phase information for a given amount of resources can 
therefore be obtained by minimizing (S) /In A. 

The calculations given in Ref. [28[ also used some addi- 
tional simplifications to increase the speed. Rather than 
searching all possible values of the size of the next NOON 
state (which could be many thousands), only about 30 
were searched. Denoting the number of resources used 
so far by A, only sizes up to [A/3] were searched, and 
in step sizes of [A/100]. In addition, each run was used 
to obtain phase estimates for each intermediate number 
of resources. There is a complication in doing this, be- 
cause many of the NOON states have v > 1, so there 
are not phase estimates for all values of A. To obtain 
phase estimates for all A, the estimates for these skipped 
values were approximated by taking the phase estimate 
from the next lower value of A that was not skipped. 
This gives samples for all intermediate values of A, and 
cannot underestimate the variance. Even with these sim- 
plifications, the phase variance scaled approximately as 
40/ A 2 . 

On further investigation, we have found that the feed- 
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back scheme of Ref. [28| can be further simplified, and 
provide more accurate phase estimates. One simplifica- 
tion is in the approximation used for the entropy. It is not 
practical to calculate the entropy exactly, but it can be 
accurately approximated from the moments of the dis- 
tribution. Ref. [HI used a higher-order approximation 
involving the third- and fourth-order moments (the ap- 
proximation given in Eq. (10) of [281 ] is for the negative of 
the entropy). The calculation can be simplified by using 
the simpler approximation 

5« i[l + ln(27r)+lnVb], (8.1) 

Numerically it is found that this simpler approximation 
gives accuracy of the phase measurements similar to that 
using the higher-order approximation. 

It was also found that the variance could be reduced 
if the quantity that was minimized was of the form 
[(S) + C]/lnN, where C is some constant. One value 
that gave particularly good results was C = — ln(27r) 
(which just corresponds to changing the sign on ln(27r) 
in Eq. before dividing by In AT). In Ref. Q the 

measurement scheme was started with 100 single-photon 
measurements, in order to obtain a narrowly peaked dis- 
tribution. The drawback to this approach is that it in- 
creases the variance for total photon numbers of a similar 
order. The majority of the resources need to be used in 
NOON states with high photon numbers in order to ob- 
tain results near the Heisenberg limit. To reduce this 
problem, the number of single-photon measurements was 
reduced to 10. 

In Fig. [TO] the results from three different approaches 
are shown. The first scheme described in this section, 
except with the sampling scheme of Ref. [28| , is shown as 
the dotted line. This sampling appears to give slightly 
worse results in this case as compared to the results where 
a fixed maximum value was used. For larger values of 
N, VrN 2 is typically above 60. The results where [(£) — 
ln(27r)]/ In TV was minimized are also shown in this figure. 
This seemed to give the best results out of the alternative 
variations of the scheme of Ref. [H| that were tested, 
with variances significantly below those where VrN was 
minimized. 

The third alternative that is shown in Fig. [TO] is that 
where VrN 2 is minimized when choosing the size of the 
NOON state, but the feedback phase is chosen to en- 
sure that the probabilities of the two detection results 
are equal (as in Ref. (HI). This can be achieved sim- 
ply by replacing [(S) + C]/\nN with (S) + 2 In N. This 
scheme also gives better results than the first scheme, and 
gives variances close to those for the scheme minimizing 
[{S) -1h(2tt)]/1iiJV. 

Even though the schemes where the size of the NOON 
states are adapted are far more flexible than the schemes 
with a fixed sequence of NOON states, they do not ap- 
pear to give any better results. For most methods tested 
the variance obtained is far greater than that for a fixed 
sequence of NOON states. Even for the best method 



tested, the variances obtained were simply comparable 
to those for the fixed sequence of NOON states, and did 
not give any improvement. Because the fixed sequence 
of NOON states is far simpler to implement, it would be 
preferable to use for most applications. 



IX. CONCLUSIONS 

NOON states achieve the highest phase resolution pos- 
sible for a given photon number, but do not provide an 
unambiguous phase estimate. By combining measure- 
ments with a range of NOON states with different photon 
numbers, it is possible to achieve a phase resolution at 
the Heisenberg limit, and eliminate the ambiguity in the 
phase estimate. As well as being applicable to NOON 
states, these schemes can also be performed using sin- 
gle photons, by passing them through the phase shift 
multiple times, as experimentally demonstrated in Rcfs. 

[Mill- 
In this paper we have considered many different mea- 
surement schemes involving sequences of NOON states, 
some of which give a variance scaling at the Heisenberg 
limit, and some of which do not. Below, we discuss these 
results, but for the benefit of the reader we present a 
summary of all the key results in this paper in Table [I] 

Using a sequence of NOON states with photon num- 
bers that are decreasing powers of two, an elegant adap- 
tive optical phase measurement can be performed by 
implementing the quantum phase estimation algorithm 
from quantum computing theory 0, H[ . This provides a 
canonical measurement of the phase, which is a remark- 
able result because normally it is impossible to perform 
a canonical phase measurement using linear optics. The 
canonical measurement is possible because the photons 
are separated into distinct time modes. 

This approach does not provide variance scaling as 
the Heisenberg limit, because the underlying state has 
poor phase properties, and gives a significant probability 
for large phase errors. Reduced phase variance could be 
achieved using states that are entangled between the dif- 
ferent time modes. A more practical alternative is to use 
multiple copies of the state, which also provides reduced 
canonical phase variance. The canonical measurements 
can no longer be performed, but for 4 or more copies it is 
possible to accurately approximate the canonical phase 
measurement using adaptive measurements. The mini- 
mum variance is then obtained using 5 copies. 

Perhaps surprisingly, it is also possible to achieve the 
Heisenberg limit using nonadaptive measurements. If the 
same number of copies of each size of N OON state is used, 
then the phase uncertainty scales as vmJV /N. In order 
to achieve the Heisenberg limit, the number of copies 
must be a function of the size of the NOON state. In 
particular, the number of copies must be of order unity 
(e.g. 2) for the largest NOON state, and should increase 
by a constant number (e.g. 3) at each step as one halves 
the size of the NOON state. 
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Scheme 


NOON Sequence 


Intrinsic Var. 


Reference 


Estimate Var. 


Rating 


Reference 


single photons 


1,1,"- ,1 


©(1/TV) 




G(1/TV) 


* 




QPEA: M = 1 


2 K ,2 K -\--- ,1 


©(1/TV) 


[25]; Sec. IV 


©(1/TV) 


* 


[25J; Sec. V 


generalized QPEA M = 2 


2 K twice, • ■ • , 1 twice 


©(InTV/TV 2 ) 


Sec. IV 


©(1/TV) 


* 


[25J; Sec. V 


generalized QPEA M = 3 


2 K thrice, • ■ • , 1 thrice 


©(1/TV 2 ) 


Sec. IV 


e*(i/jv 3 / 8 ) 




Sec. V 


generalized QPEA M > 4 


2 K M-fold, • • • , 1 M-fold 


©(1/TV 2 ) 


Sec. IV 


e*(i/A 2 ) 


***** 


[25J; Sec. V 


generalized QPEA K fixed 


2 K M-fold, • ■ • , 1 M-fold 


e(2- 2A '/7V) 


Sec. VI 


0*(2- 2A '/Af) 


** 


Sec. VI 


nonadaptive, best M(K) 


2 K ,--- ,1, each M(A')-fold 


Q(M(K)/N 2 ) 


Sec. VII 


©*(ln7V/TV 2 ) 


**** 


Sec. VII 


nonadaptive, best M(K, k) 


{2 fc }, each M(K, k)-Md 


e(i/A 2 ) 


[27]; Sec. VII 


©(1/TV 2 ) 




[27]; Sec. VII 


hybrid QPEA+singles 


2 K ,2 K ~\--- ,1,1,... ,1 


fi(l/TV 3/2 ) 


Sec. VII 


0(VlnTV/TV a/2 ) 


*** 


[27]; Sec. VII 


non-binary adaptive 


adaptively chosen 


N/A 




<T(1/TV 2 ) 


**** 


[28]; Sec. VIII 



TABLE I: Scalings of the phase variance for the various phase estimation schemes considered in this paper. The column 
"NOON Sequence" shows the number of photons in each state in the sequence, in temporal order (which is important for 
adaptive measurements). In all but the last scheme, these photon numbers are powers of 2 between 1 (a single-photon state) 
and 2 K . Two variances are shown: the intrinsic variance (i.e., for a canonical phase measurement), and the variance for 
the actual measurement scheme. The former is undefined for the final scheme because the sequence of states there is not 
predetermined. Rather than exact variances, only the scaling of the variance with TV, the total number of photons, is shown. 
The O, O and Q. are standard Bachmann-Landau notation; O means the asymptotic scaling, while O and $1 mean an upper 
and lower bound, respectively, on the asymptotic scaling. The symbol O indicates that multiplying factors scaling as powers 
of log (TV) are ignored. We use an asterisk, such as in O*, to indicate that the scaling is only proven numerically. For each 
result we give the section where the result is discussed, as well as a citation to previous work (if it exists). Finally, the 
"Rating" column gives an easy guide to the performance of the various phase estimation schemes. The stars have the following 
interpretation: * — variance equal to, or a constant multiplier greater than, the SQL of 1/TV asymptotically; ** — variance 
scaling as the SQL, but smaller by a constant multiplier; *** — variance scaling roughly as the geometric mean of the SQL, 
and the Heisenberg-limit (HL) of n 2 /TV 2 asymptotically; **** — variance scaling almost as well as the HL; ***** — variance 
equal to, or a constant multiplier times greater than, the HL. 



An alternative to fixing the number of copies of a given 
NOON state, is to consider schemes which choose the size 
of the next NOON state to be used adaptively. This ap- 
proach can yield a variance close to the Heisenbcrg limit. 
The obvious approach, which is to choose a NOON state 
which minimizes the variance multiplied by TV 2 , provides 
scaling slightly worse than the Heisenberg limit. How- 
ever, it is possible to obtain scaling much closer to the 
Heisenberg limit using an alternative scheme where the 
entropy of the phase distribution is minimized instead. 
This approach was first presented in Ref. [H]. Here we 
have simplified and improved that scheme in a number 
of ways. 

Some other alternatives that might be expected to pro- 
vide phase measurements at the Heisenberg limit do not. 
This can be conveniently proven by examining canon- 
ical measurements for the equivalent two-mode state. 
One such alternative is hybrid measurements, where the 
QPEA is supplemented with single-photon detections, to 
reduce the probability of large phase errors. This ap- 
proach yields a canonical phase uncertainty scaling as 
1/TV 3 / 4 , and the uncertainty in the phase estimate from 
the hybrid measurements scales slightly worse. Another 
alternative that does not yield the Heisenberg limit is 
simply increasing the number of repetitions, rather than 
the maximum size of the NOON state (or, equivalently, 
number of passes). While this gives results which remain 
below the standard quantum limit of 1 / \f~N for phase 
precision, they scale in exactly the same way as the SQL. 



In this paper we have demonstrated this latter re- 
sult experimentally by keeping the maximum number 
of passes fixed while increasing the number of photons 
used. This demonstration helps illustrate why gravita- 
tional wave interferometers do not approach the limit 
of 1/iV for phase precision even though they use multi- 
ple passes: in such interferometers the number of passes 
is fixed at its maximum value (which is typically very 
large), while the number of "repetitions" (the number 
of photons in the coherent state which is used) is also 
made very large. This contrasts with all of the schemes 
that do scale as 1/TV, in which the number of repetitions 
is of order unity for the largest pass-number (or largest 
NOON state), and in which the number of passes (or size 
of the NOON state) must be varied across a large range 
of values from one to its maximum value. 
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APPENDIX A: PROOF OF THEOREM Q] 

Here we give the full proof of Theorem [TJ which gives 
the scaling of the canonical phase variance for repetitions 
of the measurements. In general the /m(^) can be ob- 
tained by the recurrence relation 



/m(«) 



n k 

E 

k=0 



fM-i{n - k). 



(Al) 



Note that this is a discrete convolution of /a/-i with 
the uniform distribution. This recurrence relation comes 
about because the number of combinations of the val- 
ues of Ti\, i%2, ■ . • , n m may be obtained, for each value 
of n\, by determining the number of combinations of 
?i2, . . . , riM that sum to n — n\. The simplest example is 
M = 1, where /i(n) = 1 for < n < N K and /i(n) = 
otherwise. For M = 2, evaluating the sum yields 

7i + l, < n < N K 

[ 2N K -n+1, N K <n< 2N K (A2) 
0, otherwise. 

A crucial point is that the increment between successive 
values of /m is bounded. The recurrence relation implies 

\f M {n + \)-f M {n)\ 

< (N K + 1) max | / M -i (k + 1) - /m-i (k) I • (A3) 

Because \$2{n + 1) — f2(n)\ < 1, this means that 

\fm(n + 1) - fu(n)\ < (N K + l) u ~ 2 . (A4) 

The function is symmetric, so /m(ti) = Jm{N — n). It 
is possible to determine the general value of /m(?i) for 
n < Nk- The solution is 

, , , (n + M-l\ (n + M-l)\ 
fM ^={ M-l )= nl(M-iy. ~ 

To prove this, first note that it is correct for M = 2. 
Next, use the recurrence relation to give 



fM{n) - fAi(n - 1) 



Nu 



= E fM-i(n - k) - y^J M -i{n -k-1) 



k=0 
N K 



k=0 
N K + 1 



^2 fM-i(n - k) - ^2 /m-i (n - k) 



k=0 



k=l 



hi-i{n) - fM-i(n - N K - !)■ 



(A6) 



For n < Nk, the argument of /m(ti — Nk— 1) is negative, 
so this term is zero. If the solution is correct for M — 1 



and for n — 1, then 

/at («) = /m(" - 1) + /m-i(i) 

(n + M-2)! (n + M-2)! 
~ (n-l)!(M-l)! + n!(Jl/-2)! 

M - 2)! 



(n + M - 1)- 
(n + M - 1)! 



n!(M- 1)! 



n!(M-l)! ' 
Using the formula gives, for n < Nk, 

f M {n + l)- f M (n) < (n + 2) M - 2 , 
(n + 2) M ~ 1 



f M (n + l) + f M (n) > 



(M-l)! 



For M > 2, this gives 
[fM(n + l)-f M (n)] 2 



< (M-l)I(JVjf + 1) 



M-3 



(A7) 

(A8) 
(A9) 

(A10) 



[/M(n + 1) + / M (n)] 

Due to symmetry, the same result holds for n > N — Nk- 
This part of the derivation also holds for n = — 1 or N. 
For M = 2 or 1, a decreasing function of n is obtained, so 
the n cannot be replaced with Nk for the upper bound. 

The distribution for /m(ti) cannot have a nontrivial 
minimum (the trivial minimum is zero at the bounds). 
That is, it cannot decrease with n, then increase. This 
can be shown by induction in the following way. If and 
only if /m (n) has a nontrivial minimum, then there exist 
n, n', Sn > and 5n' > such that n' < n, > 
/m(« — Sn) and /m(ti') < /m(«' — Sn'). That would 
imply fu-i(n) > /a/-i(™ - Nk - Sn) and f M -i{n') < 
fu—i (n' — Nk — Sn'). However, that would imply that 
fM-i{n) has a nontrivial minimum. As we know that is 
not the case for M = 1 and 2, using induction shows that 
fuin) cannot have a nontrivial minimum for any M . 

This result implies that fhi{n) increases to a maxi- 
mum at the centre, then decreases. In particular, for 
N K < n < N - Nk, /iVf(n) > fM(N K ). Now we have 
fu{N K ) > (N K + 1) M "7(M - 1)!, so 

[fM(n + l)-f M (n)] 2 



WH)lW»)] <(M-mN K + D 
for Nk < n < N — Nk- Therefore, overall 



N K 

E 



[f M {n+l)-f M {n)f 



< M\(N K + 1) 



M—3 



M-2 



n t± x [fM{n + l)+f M (n)} 
The variance Vc for the state is given by 
2(1-1(^)1) 



(All) 



(A12) 



N , 



< 



< 



(jvx + ir 

1 y> (/M(n + 1)-/ M (n)) 2 
(7V^ + 1)« (/ M (n + 1) + / M (n)) 

M! M!M 2 
(JV X + l) 2 ~ (N + M) 2 ' 



(A13) 
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Hence for M > 2, the variance scales as the Heisenberg 
limit of 1/N 2 . 

In the case M = 2, the bound for n < Nk is 



[fM(n+l)-f M (n)\ 2 < 1 



[/M(n + 1) + /M(n)] - n + 1 
Taking the sum from n = to TV^ — 1 gives 



(A14) 



^ [/«("+!) + /*/(»)] "in ^TT- ln( ^ + 1) ' 

(A15) 

This approach therefore yields an upper bound on the 
variance that scales as (In N)/N 2 , rather than the 
Heisenberg limit of 1/N 2 . In fact, this is the actual scal- 
ing for these states. Evaluating the exact value of Vc 
gives 



N K 



2(1 -|(^>|) =2 1 



(N K + 1) 



£y n ( n + i) 



— + N 2 - + °( N )■ 



(A16) 



where c w 6.5949. 



APPENDIX B: DETAILS OF SHARPNESS 
FORMULA 

Here we give further details of the meaning of Eq. (|3.7p 
and its interpretation as an average over the sharpnesses 
of the individual Bayesian probability distributions ob- 
tained. A subtlety that has been omitted in the discus- 
sion in the main text is that the probability distribution 
for the measurement results is now a function of both 
the system phase and initial feedback phase, and should 
therefore be written P(u m \(f>, $i). The other feedback 
phases are all chosen deterministically based on the mea- 
surement results and $i, and can be omitted here. We 
use the notation &j to indicate the feedback phase before 
the j'th detection. The value of may then be written 
explicitly as 



— / dSx^P^I^Oe 



i(f> 



(Bl) 



Because the phases are relative, the probability of the 
measurement results depends only on the difference be- 
tween the system phase and the first feedback phase. 
That is, 



P(u m \4>,$ 1 ) = P(u m \<t>-$u0)- 



(B2) 



Similarly the probability distribution for the system 
phase will depend on $i, but will depend only on the 
difference between the two; that is, 



P(0|tZ m ,*i) = P(^-*i|iZ m ,O). 



(B3) 



In the remainder of this appendix we use the probabilities 
in this form, and omit the argument of 0. 

The later feedback phases, as well as the phase es- 
timate <f>, depend on the measurement results and the 
value of $i. Because the phases are relative, these phases 
should only depend in a linear way on the initial feedback 
phase. That is, 



(u m ,<E>i) =(f>(u m ,0) + <f>i. 



(B4) 



Using this, and changing variables to H = <f>— $i, we find 
that /i becomes 



1 

2^ 



dN^P(S m |N,0)e 



i(0(S m ,O)-N) 



(B5) 



In the following analysis we need to consider a set 
of measurement results given an actual system phase <f> 
and initial feedback phase $i, as well as the Bayesian 
probability distribution for the system phase given those 
measurement results. Since we effectively have two sys- 
tem phases, the actual system phase and the dummy 
variable for the system phase in the Bayesian probabil- 
ity distribution, we use the symbol 4>b for the variable 
in the Bayesian probability distribution. Wc also use 
Kg = 0b — Then the optimal value of (f> is 

4>(u m , $i) = *! + arg / e m *P(K B \u m )dK B 



arg 

arg 
arg(e 



^(NbI^Nb 
l ^P{4>B-®i\u m )dfc 



i<pB 



(B6) 



where the angle brackets indicate that the average is 
taken for that value of u m and $i. The integral for fx 
then becomes 



-E 

2tt 4-^ 



e jN P(M m |K)dK 



(B7) 



In this expression the integral over H can be an integral 
over the actual system phase for fixed <E>i, or an integral 
over the feedback phase $! for fixed <fi. For Eq. (|3.7p in 
the main text, we have taken <!>i = so H = <j>. 
This expression can be rewritten as 

(B8) 

The interpretation of the variables in this expression now 
changes. We now interpret this expression as the sharp- 
ness of the Bayesian probability distribution for the sys- 
tem phase given measurement results u m averaged over 
the initial feedback phase <f>i and measurement results 
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u m , for a specific system phase. To make this more clear, 
we can relabel the variables as below 

= E[|(e^}|]. (B9) 

The E indicates an expectation value over the measure- 
ment results and H = <j> — $1. This is independent of 
the actual system phase, and may therefore be regarded 
as an average over the measurement results and $1. As 
above the angle brackets indicate an average over the sys- 
tem phase in the Bayesian probability distribution, 0b- 
In the main text we simply call this c/>, as we do not 
have the actual system phase and system phase for the 
Bayesian probability distribution in the same expression. 



APPENDIX C: GENERAL MEASURE OF 
RESOURCES 

As explained in Sec. II, we adopt the general defini- 
tion of N in a system-independent way as the size of 
the smallest interval supporting the Fourier transform of 
the family of states ((/>)). Here \ip((f>)) is interpreted 
as follows. The entire measurement scheme may be re- 
garded as preparation of a pure quantum state, followed 
by a unitary operation that depends on </>, and finally 
a measurement. The preparation of the state and the 
^-dependent unitary operation may be grouped together 
into preparation of the ^-dependent state \ijj((t>)}- In gen- 
eral there will be restrictions on both the input state (for 
example, limited photon number) and the unitary oper- 
ation (for example, a limited number of passes through 
the phase shift) that will limit the possible states \ip(cf>)). 

Taking the Fourier transform of the family of states 
\ip((j>)) yields 



1 



/2tt 



(CI) 



The states |^(s)) are not (necessarily) normalized. We 
consider the case where restrictions on the measurement 
scheme imply that the family \ip(s)} has support on a 
finite interval [A m i n , A max ], and so may be represented 
as 



1 



e u +\j,{8))da, (C2) 



We define N as the smallest value of A max — A m j n . 
There are two main cases that we can consider: 



1. |V»(s)) has support on a discrete set {sj} G 
[A min ,A max ], with every s; - Sj being integer- 
valued, 



2. ma y be nonzero for any s € [A m i n , A max ]. 

In the first case, \ip{^>)) is periodic in <j> with period 27r. 
Then is a phase, and the variance may be measured 
via the Holevo variance. In the second case there is no 
periodicity imposed on \ip((j))). The appropriate measure 
of the variance of <p is then the usual variance. 

The first thing to notice is that the states \4>{s)) may 
be taken to be orthogonal without affecting the minimum 
variance that may be obtained. This can be proven as 
follows. Writing these states in some basis of orthogonal 
states 



|V>(0)> 



1 



'2it 



N„ 



1>(8,C)\0dCds. (C3) 



Now we can construct the modified state \tp'(<fi)) by 
adding an additional subspace with orthogonal states \s). 



WW) = 



1 



27T 



1>(8,Q\0dCd8. (C4) 



This modified state is not necessarily normalized. Per- 
forming a measurement of this new subspace via projec- 
tions |0)(0|, with 



If) 



1 



V2W- 



Js8 



s)ds, 



(C5) 



yields 



1 

27T ./ 



(C6) 

This state is unchanged from that in Eq. (|C3[) , except for 
a shift in <j> of 0. Thus the state (</>)) can not give any 
less information about <f> than |^>(0)). In addition, the 
state {(/>)) is of the form (|C2[) . except \i/j(s)) has been 
replaced with the orthogonal states 



i>(s,0\t)dc 



(C7) 



Hence the states \tp(s)) may be replaced with orthogonal 
states with no loss of phase information. Thus, in exam- 
ining the limits to estimation of 0, we can assume the 
|?/>(s)) are orthogonal with no loss of generality. 

In the case where the frequencies {sj} are restricted to 
differ by integer values, then \ip(s)} becomes a series of 
delta functions in s. The integral (|C2|) becomes a sum 



N 

|^(0)) = e ^."^ e -^( S )| S ) 
s=0 



where 



s+e 



\i>(s'))ds' , 



(C8) 



(C9) 
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for 1 > e > 0. Assuming the \ip(s)) are orthogonal, the 
\s) are orthogonal as well. This state is then identical to 
that for a phase shift on a single mode with a maximum of 
N photons, and it has a phase uncertainty lower bounded 
by Eq. (|2.3p . which we repeat here: 



A0hl = V Vk = tan 



N 



7T 

iV' 



(CIO) 



In the case where \tp{s)} is nonzero for any value of the 
frequency s G [iV m i n , 7V max ], we may find the bound on 
the variance as follows. Without loss of generality, the 
family of states can be given by 



N 



Js<t> 



fa)\s) 



(Cll) 



where here we have normalized {|s)} as (s\s') = S(s — 
s'). In this continuous case the system is equivalent to 
position and momentum, with the frequency s equivalent 
to position, and 4> equivalent to momentum (provided we 
use units where h = 1). It is therefore clear that the 
optimal measurement for 4> is a projection in the basis 



e ls *\s) 



(C12) 



This yields a phase variance of 



V 



</»iv(0)r# , (ci3) 



where ipW is the inverse Fourier transform of ip( s )- This 
result is for </> initially completely unknown, with a flat 
prior distribution over the whole real line. 

If we apply a phase shift of Acf> to the state, then this 
simply changes the mean value of 0, but leaves the vari- 
ance unchanged. Therefore, in considering the problem 
of minimizing the variance, we may consider states with 
mean <p equal to zero without loss of generality. The min- 
imum uncertainty state then follows from the analogy of 
position and momentum. The bounds of and N on s 
arc equivalent to an infinite square well potential of size 
N . The minimum energy of a state in this infinite square 
well is E = ir 2 /{2N 2 ), where we have taken h = 1 and 
the mass equal to 1. Because <j) is equivalent to the mo- 
mentum, and we are taking the mean <p to be equal to 
zero, V = 2E, so the minimum variance is V = ir 2 /N 2 . 
This gives the lower bound on the uncertainty of 



A0 



HL 



N 



(C14) 



This result may also be found by taking the limit of the 
discrete case. 

The case where the power of e 1 ^ can only take integer 
values is relevant to interferometry and to the gate for- 
malism of Ref. 2(3]. For photons, each pass of a single 



photon through the phase shift gives a multiplication by 
a factor of e l< ^, so N max corresponds to an upper limit 
on the number of photon passes for the measurement 
scheme. It is not possible to have a negative number of 
photon passes, and normally there is no lower bound on 
the number of photon passes, so N m i n = and N is the 
maximum number of photon passes. In cases where there 
is a minimum number of photon passes, then N can also 
be taken to be the difference between the minimum and 
maximum numbers of photon passes, and yield a tighter 
bound on the phase uncertainty. 

In the gate formalism of Ref. [2(| each application of 
the phases shift is a gate of the form 



1 I 

e 



(C15) 



applied to a qubit. Each application gives multiplication 
by a factor of e 4 * for the |1) basis state, and no multipli- 
cation for the |0) basis state. Therefore an upper bound 
on the power of e 1 ^, A^ max , is given by the number of 
applications of the phase shift. The lower bound is zero, 
so N is simply the number of applications of the phase 
shift. In particular cases the details of the measurement 
scheme may mean that there are tighter bounds on the 
power. For example, for some gates the qubit may be ini- 
tialized to |1), in which case N m i n could be taken to be 
greater than zero. As in the case of photons, this would 
yield a tighter bound on the phase uncertainty. 

The third application we discuss is that of metrology 
of a Hamiltonian with an unknown parameter <f>, describ- 
able as = <pH. We consider estimation of <j> by prob- 
ing the Hamiltonian for total time T. This is the total 
time that systems evolve under the Hamiltonian, which 
can include parallel or serial evolution of systems. The 
maximum power of e 1 ^ that can be obtained is — A m j n T, 
where A m i n is the minimum eigenvalue of H . The mini- 
mum power that can be obtained is — A max T, where A max 
is the maximum eigenvalue of H. The difference between 
the upper and lower bounds on the power of e 1 ^ is there- 
fore A" = \\H ||T, where \\H\\ is the difference between the 
maximum and minimum eigenvalues of H. 

In the case of metrology of a Hamiltonian, the time 
intervals can be any real numbers, so the powers of e 1 ^ 
can take any real values within the bounds. Therefore, <j) 
may be measured as any real number, rather than mod- 
ulo 2tt, and the bound on the phase uncertainty is exactly 
tt/N = n/(\\H\\T), and is measured by the standard de- 
viation. In Ref. [43| the total probe time is ut, because 
there are v independent probes of time t. The limit to 
phase measurement precision there is again n/N, with 
N = vt\\H\\. Note that Ref. [H] uses the terminology 
"fundamental limit" for the minimum possible phase un- 
certainty, and uses the terminology "Hciscnberg limit" in 
a different sense. 
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